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ABSTRACT 


The objective of this thesis research is to develop a Back-Propagation Neural Network 
(BNN) to control certain classes of unknown nonlinear systems and explore the network's 
capabilities. The structure of the Direct Model Reference Adaptive Controller (DMRAC) 
for Linear Time Invariant (LTI) systems with unknown parameters is first analyzed and then 
is extended to nonlinear systems by using BNN. Nonminimum phase systems, both linear 
and nonlinear, have also been considered. 

The analysis of the experiments shows that the BNN DMRAC gives satisfactory 


results for the representative nonlinear systems considered, while the conventional Jeast- 


squares estimator DMRAC fails. Based on the analysis and experimental findings, some 


general conditions are shown to be required to ensure that this technique is satisfactory. 
These conditions are presented and discussed. It has been found that further research needs 
to be done for the nonminimum phase case in order to guarantee stability and tracking. 
Also, to establish this as a more general and significant control technique, further 
research is required to develop more specific rules and guidelines for the BNN design and 


training. 
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I. INTRODUCTION 


In the past three decades, major advances have been made in adaptive identification 
and control for identifying and controlling Linear Time Invariant LTI systems with unknown 
parameters. The choice of the identifier and controller structures is based on well established 
results in systems theory. The adaptive control theory is then improved and applied to 
nonlinear dynamic plants employing neural networks with the right choice of the identifier 
and controller structures. 

The objective of this thesis research is to develop a Back-Propagation Neural Network 
(BNN) to control certain classes of unknown nonlinear dynamical systems. Initial analysis 
is directed towards a Direct Model Reference Adaptive Controller (DMRAC) for an 
unknown LTI system. Results of the simulation are displayed for this system. The same 
analysis is then performed for an unknown nonminimum phase system. The adaptive contro] 
theory is then applied to nonlinear unknown systems by employing neural networks. 
Basically, four nonlinear models have been analyzed and simulated. Lastly, the adaptive 
control algorithm is tried with nonlinear unknown nonminimum phase systems again by 
employing neural networks. 

In what follows, Chapter II presents the analysis of a DMRAC for an unknown LTI 


system and for an unknown nonminimum phase LTI system. Chapter If] concerns neural! 


networks in general and adaptive control of nonlinear plants employing neural networks. 





Chapter IV is composed of the simulations that have been carried out. The results, 


conclusions and discussions are given in Chapter V. 


Il. ADAPTIVE CONTROL OF UNKNOWN LTI SYSTEMS 


A. ADAPTIVE CONTROL IN GENERAL 

The search for design techniques to control systems with unknown parameters has 
drawn much attention in recent years. Adaptive control is currently one of the most 
commonly used methods in the control of systems with uncertain dynamics. Several 
applications of adaptive control such as ship steering, aircraft control, robot manipulation, 
chemical process control and bio-medical engineering have been performed in the past years 
[Ref.1]. The general structure of an adaptive contro] system is shown in Figure 1. 

During the development of adaptive control, two major classes have emerged: learning 
systems, which lead to the introduction of learning automatons in the control literature, and; 
adaptive systems using a model reference, known as model reference adaptive control 
systems (DMRAC). The design of the adaptation algorithm in DMRAC is based on stability 
theory since the stability of the closed loop system is a fundamental requirement in the 
design of control systems. 

In adaptive control of an unknown linear time invariant (LTI) system, unknown 
parameters are estimated by an on-line estimator. Based on the estimated parameters, an 
adequate design can be achieved to implement the chosen contro! law. This process is 


commonly referred to as indirect adaptive control. Figure 2 shows the structure of an 


indirect adaptive control system. 





On the other hand, it is possible to parametrize the unknown system in terms of the 
control parameters (e.g. the state-feedback gains) to implement the chosen control law. In 
this case, the estimation and the control processes are carried on together. This alternative 
approach is called direct adaptive control. Figure 3 shows the structure of a direct adaptive 


control system. 
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CHOSEN CONTROL LAW 
Figure 1.General Adaptive Control Structure 
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The main idea in all models is the parameter estimation. When estimating the 
parameters, the uncertainties must be expressed linearly in terms of a set of unknown 
parameters. 

In linear systems, the regression can be adequately used to obtain the state 
measurements or observations from the systems, with the unknown parameters as 


coefficients. However, in nonlinear systems, nonlinear functions of the measurements or 


observations are generally required. With unknown nonlinear systems, these nonlinear 
functions cannot be specified most of the time. So, the use of neural networks as generic 


parametric models is highly recommended in such cases. 
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Figure 2. Indirect Adaptive Control Algorithm 
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Figure 3. Direct Adaptive Control Algorithm 





Before developing a neural network based direct model reference adaptive controller 
for unknown nonlinear systems, the design of a DMRAC for unknown LTI systems will be 


presented. 


B. ANALYSIS OF A DMRAC FOR UNKNOWN LTI SYSTEMS 
Consider an LTI system 
AQ Xt) = Bq), (2-1) 
with A(q) and B(q) being polynomial operators! with unknown coefficients. For 
DMRAC design, it is assumed that: 
1. A(qg) is monic and degree[A(q)]=n is known. 
2. The transfer function B(q)/A(q) is strictly proper and the relative degree is known. 


The goal of the controller is to move the closed loop system's dynamics such that 


Dg) X0=0, (2-2) 


where D(q) is the arbitrary monic stable characteristic polynomial operator of the desired 
system, and v(t) is an external input. The general structure of DMRAC is shown in 
Figure 4. 

By employing a steady-state Kalman filter, observer-state-feedback system yields the 


following structure for the feedback controller, 





‘The argument q of the polynomial operators is the forward time-shift operator in discrete 
time modelling. 


ut) = 2D ay + yy « Wo, 


a(q) a(q) 
where a(q) is the monic characteristic polynomial operator of the observer. It can be 
chosen arbitrarily, provided that the roots are in the stable region. The polynomial 


operators h(q) and k(q) are the feedback polynomial operators of the unknown system. 


REFERENCE 
MODEL 
ESTIMATOR 


UNKNOWN B(q) 
CONTROLLER SYSTEM (ZG) 


Figure 4.General Structure of the DMRAC 


By using partial state representation, equation (2-1) can be reconstructed as 


A(q) z(t) = u(t), 
yt) = Bq) xb). 





Combining equations (2-3) and (2-4) yields the following: 


[a(q)A(q) - Mq)Alq) - Hq) Rg)lalt) = a(q) 0), (2-5) 
Wd) = Bat) 


To obtain the desired closed loop behavior, the Diophantine equation 


a (q)A(q) - h(g)A(q) - k(q) Big) = «(QB (2-6) 
1 


should be satisfied so that the closed loop system poles coincide either with those of the 
reference model or with the system's zeros. The factor //b, is needed to ensure that the 
right side of the equation is monic and the roots of B(q) are assumed to be inside the unit 
circle (i.e. minimum phase) [Ref.2]. 

In equation (2-6), the polynomial operators A(q) and B(q) are assumed to be relatively 
co-prime (i.e. there is no pole-zero cancellation) which guarantees a unique solution for h(q) 
and k(q). 

Since A(q) and B(q) are unknown polynomial operators, an estimator is required to 
estimate the system parameters on-line, based on the generic recursive regression analysis. 


The regression equation, 


a(q)u(t) = h(q) u(t) + k(q) yt) + ran Dy, (2-7) 


I 


is obtained by using partial state transformation. 
Using q as the forward time-shift operator, the filtered input and output signals can be 


defined as 


q"a(gy"() = xb, 


(2-8) 
q-"a(q)u*(t) = uft). 
Equation (2-7) can be expressed in a more convenient form as follows: 
-r -(ner, -(ner) 1 eS 
q(t) = q°""*Ph(q) u*(t)+q- Mh 9" +g DQ 0), (2-9) 


1 


where n is the order of the unknown system and r is the number of the closed loop 
system poles which must be placed to match those of the reference model. Equation (2-9) 


can be represented in a matrix formation as 


q u(t) = &"(®@,, 





h, 
where u'(t-r-1) h, 
u' (t-r-2) 
Hine) 
u’(t-r-n+1) k, (2-10) 
®, = 
®(t) =| y*(t-r-1) k, 
y*(t-r-2) 
Ken-1) 
y(t-r-n+1) 1 
Lq Digywh | 5, 








By using the linear regressor @(f) and recursive estimation of @, as 


P(t) @(Y) [u(t-r) - O7() O()] (2-11) 


@(t+1) = @(t)+ 
1+ 61 (t) P(t)® (1) 


and, 


P(t) O() O"(t) P(t) 


P(t+1) = P(t) + , 
1 + ©" (t) P(t) ®(t) 


(2-12) 


the unknown parameters of the system are estimated. Hence, the control equation (2-3) 


can be rewritten as 


ut) = q-"h(qu®(t) +q-" ky *(t) +n, (2-13) 
1 


Equation (2-13) can be represented as 
ut) = ©" (t)@,, 


u*(t-1) | 
u‘(t-2) 
(2-14) 
u*(t-n+1) 
(0 =| y*(t-1) 
y"(t-2) 





y"(t-n+l) 
wt) 


10 


It can be noticed that all the equations in both estimation and control phases are the 
same except for the last element in the vector @(f)and © (t). Hence, this identical structure 
of the estimator and the controller prevents the unnecessary intermediate control design 
calculations and speeds up the control process. 

Figure 5 illustrates the estimation and the control algorithm of the DMRAC where 
e( t) isa linear associative memory with recursive estimation to minimize the mean square 


errors between u(t) and u(t) = ©7()@(t). 





ESTIMATOR 
CONTROLLER 
® 
G7) OBSERVER 


Figure 5.Estimation and Control Algorithm of the DMRAC 








As an illustration, Appendix A contains a numerical example of the design of a 


DMRAC for an unknown minimum phase LTT system. 





C. ANALYSIS OF A DMRAC FOR UNKNOWN NONMINIMUM PHASE LTI 

SYSTEM 

Consider the LTI system in section B, with polynomial operators A(q) and B(q) which 
cause the system to be nonminimum phase (i.e. at least one zero outside the unit circle in z- 
domain). 

Defining the polynomial operators A(qg) and B(q) 


A(q) = q"+aqr+..+@, 
’ (2-15) 


Bq) = q"+ bq") +... +b, 
and using the same structure for the feedback controller in equation (1-3), the following 
Diophantine equation 

h(q) A(q) + k(q) B(q) = a(q)[A(q) - P*(q)], (2-16) 
is obtained. a@(g) is the monic characteristic polynomial operator of the observer. It is 
also arbitrary and stable. P’(q) is the stable arbitrary characteristic polynomial of the 
desired closed loop system. 

In nonminimum phase systems, for estimating h(q) and k(q), the filtered partial state 
z'(t) must be replaced by an estimate in terms of the available signals u“(t) and y(t). This 


can be done using the Bezout identity as follows: 


b(q) B(q) + eq) A(q) = 1, (2-17) 


12 


which holds provided A(q) and B(q) are mutually co-prime polynomials. Hence, defining 


b(q) and e(q) as 


Equation (2-10) can be rewritten as 


a(g)ut) = &7(1)@,, 


| we? (t=r=2) 
u‘(t-r-2) 


u'(t-r-n+1) 
y (t-r-1) 
y*(t-r-2) 





y "(t-r-n+1) 
®(t) = 
u'(t-r-1) 


u* (t-r-2) 
au (t-r-n+1) 
¥(t-r-1) 


y* (t-r-2) 


yt (t-r-n+1) 





where 


u(t) = P*(qa(qgu'(t), 


y(t) = P*(ga(qy *(t). 


So, the algorithm in order to obtain a DMRAC for unknown LTI system is developed 
by using an adaptive pole placement algorithm for unknown nonminimum phase LTI 
systems. 


Using equation (2-17), the state z(t) can be estimated as 


x(t) = o(q) u(t) + Hq) WK) , (2-20) 


which implies that b(q) and e(q) are the parameters of the observer. 


Appendix B contains a numerical example of the design of a DMRAC for an unknown 


nonminimum phase LTI system. 
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Il. NEURAL NETWORKS IN ADAPTIVE CONTROL OF 


UNKNOWN NONLINEAR SYSTEMS 


A. NEURAL NETWORKS IN GENERAL 

Neural networks are composed of many simple elements operating in parallel. These 
elements are inspired by biological nervous systems. The network function is determined 
largely by the connections between the elements [Ref.3]. 

Neural networks have been trained to perform complex functions in various fields of 
applications including pattern recognition, identification, classification, speech, vision and 
control systems. Neural networks have been studied for many years in the hope of achieving 
human-like performance in the fields of speech and image recognition [Ref.4]. Today neural 
networks can be trained to solve problems that are unsuitable to conventional computers 
[Ref.3]. 

A neural network is usually a layered network consisting of an input layer, an output 
layer and at least one layer of nonlinear processing elements. The nonlinear processing 
elements, which sum incoming signals and generate output signals according to some 
predefined function are called neurons. The neurons are connected by terms with variable 
weights. The output of one neuron multiplied by a weight becomes the input of an adjacent 


neuron of the next layer. 


15 





A single neuron with nm inputs is shown in Figure 6. The individual input x(j), 
weighted by the element w(1,j) of the matrix w, are summed to form the weighted inputs to 


the transfer function [7 The neuron has a bias 6 and an output y given by 


y=T [> wij) x@+b). (3-1) 
jel 
As in equation (3-1), the transfer function net input is the sum of the weighted inputs 
and the bias b. This sum is the argument of the transfer function. The weight vector w, and 


the input vector x can be represented as 


x(1) 


x(2) 
w= [WI] 012) wm, x=] |. 3-2) 


x(n) 


x(1) 


x(2) 


x(n) 





Figure 6.A Single Neuron Model 


Two or more of these neurons may be combined in a layer and a particular network 
might contain one or more such layers. First consider a single layer of neurons where each 
element of the input vector x is connected to each neuron input through a weight matrix w. 
In this case, there are m neurons and each of them has a summer and the summer outputs 
form an m element vector z. The transfer function net input is the sum of its appropriately 
weighted inputs and bias b. At the end, all neuron outputs form an m element output vector 
y. A one layer neural network with n inputs and m neurons is shown in Figure 7. So, 


equation (3-1) can be represented for this case as 


y= PT [ILD wij) +i). (3-3) 


is] j=l 


All weights can be represented in matrix format as 


WII) WI,2) ..... wn) 


2 22) saree ‘i 
w2d) w(2,2) w(2,n) (3-4) 


wm,]) wm,2) .... w(m,n) 


where each row represents the weights of one layer. 

A network can have several layers where each layer has a weight matrix w, a bias 
vector b, a weighted input z to the transfer function, and an output vector y. Layers whose 
outputs are the network's outputs are called output layers. All other layers are called hidden 


layers. Commonly an input vector is presented to a network, the outputs are calculated and 


Lv 


an algorithm is applied to determine the weight element changes. However, one may want 
to apply more than one input vector simultaneously and get the network's response to each 


one of them. This operation is called batching and this will not be part of this thesis. 


where, 


b(2) 


n= # inputs 


m= # neurons 





Figure 7. A one Layer Neural Network 


Initialization of the weights and bias elements to small positive and negative values 
provides enough variation in the weights and biases so that neurons in the network start out 
with a range of behaviors that can be taken advantage of by the learning rule. 

Yhere are several learning rules such as Hebbian, Instar, Kohonen, Outstar learning 


rules [Ref.3]. Among the most common learning rules is backpropagation which adjusts the 
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weights and biases of the network in order to minimize the mean squared error criterion. 
This is a gradient algorithm and it is done by continually adjusting the values of the weights 
and biases in the direction of the gradient of an appropriate cost function. 

The most distinctive and appealing feature of many neural networks is that they learn 
by examples. 

Currently, the most popular and commonly used neural networks for control system 
design is the Back-Propagation Neural Networks (BNN) which is discussed in the following 


section. 


B. ANALYSIS OF BACK-PROPAGATION NEURAL NETWORKS 

A Back-Propagation Neural Network is a multilayer, feed-forward network which has 
an input layer, an output layer and at least one hidden layer. Neurons are found in the output 
and hidden layer(s), while the input layer has only input connections feeding the neurons in 
the first hidden layer. There exists no feedback or even interconnection between neurons in 
the same layer. There is generally a bias input for each neuron with an associated non-zero 
weight. A three layer network is shown in Figure 8. It has one output layer and two hidden 
layers. 

In Figure 8, wi(j,k) is defined as the connection weight for the path from the j” neuron 
in the (i-J) layer to the k* neuron in the “ layer. It is assumed that the weight vector wi(j,k) 
is a constant so that a partial derivative can be rigorously defined [Ref.5]. Then the BNN in 


Figure 8 can be represented mathematically as 


1.9 





93 = DAw9e 2 Gude F(wilax 4 61+ 82)4 BI), 3-3) 


where x(j) is the input vector to the BNN. 
In the learning process, the BNN adjusts weights wi for all i, to minimize a suitable 


function of the error between the output y and a desired output y, where yt) is the signal to 


be approximated given the input x(t), ¢ = I,....,N. 





Figure 8.A Multilayer Neural Network 


The most common error function used is 


I N 
i-— Y pxtle)- y fon’, (3-6) 
t=] 
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where y{#) is the signal to be approximated given the input x(t), t= /,....,N and yt|@) the 
output of the neural network with parameter @. A general picture of this mechanism is 
given in Figure 9. 

The BNN implements a modification of the gradient descent algorithm to update each 


weight at time ¢+/ as follows: 


1 ale? 
@(t+ 1) = @(t) - u— ———_|_ , ‘ 
(t+ 1) = @(t) BS - [. (3-7) 


where yz > 0 is the learning rate and ® is the weight vector and e(t) = y(t\®) - y ft). 


OUTPUT 


INPUT 
x(t) NONLINEAR SYSTEM 
yd(t) 






Figure 9. Obtaining the Error in BNN 


A general flow diagram for a BNN is given in Figure 10. The algorithm that is used 
to simulate the BNN is presented below. The MATLAB codes to accomplish this task are 


in Appendix C. For simulation purposes, a two layer neural network is chosen which is quite 
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effective compared with three or more layered networks. A sigmoid is used as the nonlinear 


transfer function. 


od 
0a... 
ty 
Figure 10. A General Flow Diagram for BNN 





In order to derive the back-propagation algorithm, the weights, biases, input and output are 


defined as 





(3-8) 
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The input to the first layer is given as 


v’ = wieu+ bl 


(3-9) 
com: bi t P| vy 


ua, +b) \y! 
v, 


Hence, the output of the first layer, vy can be represented as 


vy 
v= sigmoid (v’) = . 
8 | (3-10) 


Likewise, the input to the second layer is given as 


z’= w2*v+b2 
_|n C1 b,, 1+ CV, +b,, Zz) (3-11) 
Cr) 22 Cy )¥1 + Cy2¥, + by, zs 


And the output of the second layer can be given as 


zy (3-12) 
z= sigmoid (z’') = 
Ke 
So, the output of the network can be written as 
zy - 
yew 2h, fy! z =Syy*%, + Sig*% - (3-13) 
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The error can be calculated as 


e=y-y,= a3. 


Back-Propagating the error through the network, the followings are obtained: 


d2 = diag (w3'« d3)«dsig (z) 


di = diag (w2'«d2)*dsig (v’). 


(3-14) 


(3-15) 


Partial differentiations of an appropriate cost function with respect to all weight vectors 


multiplied by a proper learning rate are subtracted from the weight vectors of the 


previous iteration to obtain the new weight vectors as follows: 


J w3- u(d3 +2’) 
aw? 





w3= w3- Ub * 


= w2~- u(d2« v’) 





w2=w2- pu * as 
aw2 





wl=wi-e x 2 
awl] 


In a similar way, we can show that for the bias terms, 
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=wl-yu(dlx«u’). 


(3-16) 


ae bt pe 2 
ab2 


= b2-yxd2, 


bi ati. bl-yxdl. 
abl 





IV. APPLICATIONS AND SIMULATIONS 


A. EXPERIMENTING WITH THE BNN DMRAC FOR UNKNOWN MINIMUM 

PHASE SYSTEMS 

In this chapter, the results of the experiments using the BNN DMRAC on various 
nonlinear SISO systems, are presented. Four experiments are conducted using software 
simulations for the four classes of unknown nonlinear systems considered in Chapter III. 
The main purpose of these experiments is to see under what conditions the proposed BNN 
DMRAC works. The software simulation programs used in these experiments are listed in 
Appendix C. 

Lastly, an experiment for an unknown nonlinear nonminimum phase system is 
conducted based on the system Model 2. 

As stated in [Ref.1], four important classes of unknown nonlinear SISO systems are 
considered for direct adaptive control using the BNN. They are modeled in discrete-time for 
analysis and simulation. These are the system models used in [Ref.6] for which BNN 
indirect adaptive control has been successfully demonstrated. The four models are the 
following with J", a continuous smooth function: 

(1) Model 1: 


n-1 


wte+I)= S0 a, y(t- k) +0 [(u(t),u(t- 1),...,u(t- m+ 1) (4-1) 
k=O 
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Large mechanical] systems, hard nonlinearities such as input saturation, dead zones or 
backlash are readily described by this model [Ref.1]. 
(2)_Model 2 : 


m-1 


Wt+ 1) =T [Wt Wt- 1),.wt-n+1)] + Yb, u(t- k) (4-2) 
k=O 


The action of viscous drag on an underwater vehicle can be modelled by this model [Ref.1]. 


(3) Model 3 : 


y(t+1) =P [WO wt- 1),..Wt-n+1)] +0 [u(t),u(t- 1),....u(t- m+ 1) (4-3) 
Underwater vehicles subjected to input saturation and viscous drag can be formulated by 
this model [Ref.1]. 


(4) Model 4 : 





Wt+ I)=T [WO t- 1),...W(t- n+ 1),u(t),u(t - 1),...jU(t- m+ 7)] (4-4) 
Bilinear systems are part of this model [Ref.1]. 


1. Experiment 1: System Model 1 
In the first experiment, a nonlinear system described by Model | was controlled 


by a BNN DMRAC. The chosen nonlinear system is given in equation (4-1) as 


wWt+1) = 0.3 y(t) + 0.6 y\t- 1) + ult)’ + 0.3 ult? - 0.4 u(t) . (4-5) 
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According to the suggested training procedures, the BNN was first trained off-line. 
The training set made of a randomly generated training input u(t) and the resulting output 
y(t) of the neural network and also the external input v(t) and the desired response of the 
system, y,,(t) are shown in Figure 11. The Training set consists of 10 000 data points each 
for the input and the output measurements. The external input to the system, w(t) is a sum 
of sinusoids with different magnitudes and frequencies. The BNN was next placed on-line 
to control the system. During the on-line control phase, the BNN recursively learns to adapt 
to the required control structure. The output of the controlled system compared to that of the 
reference model is displayed in Figure 12. The reference model used in the experiment is 


chosen as 


YAt)=O8y (t-1)+v(t-1), (4-6) 
where v(f) is the external input to the system. 
2. Experiment 2: System Model 2 


In the second experiment, a nonlinear system described by Model 2 was used. 


It is governed by 


yt+1) = HO KE- DH) +25) , u(t) . (4-7) 
1 + yt? + y(t-1/ 


The input to the BNN estimator will be the regressor vector @(t) of equation (2-10). 
#(t) in equation (2-14) is the input vector during the control phase. The same reference 


model and the same procedures for off-line training and on-line control-plus-learning 
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were employed. The BNN estimator was first trained off-line with a 10 000 point training 
set. The training input u(é) and the resulting output y(t) of the neural network and also the 


external input, v(¢) and the desired response, y,,(t) of the system are shown in Figure 13. 
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Figure 11. The Training Input, the Resulting Output, the External Input and the Desired 
Response for the System Model 1 
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Figure 12. The Output of the Controlled System Compared To That of the Reference 
Model for System Model 1 
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Figure 13. The Training Input, the Resulting Output, the External Input and the Desired 


Response for the System Model 2 








Finally, the BNN was placed on-line to control, and learn the control structure and 
estimate the parameters simultaneously. As shown in Figure 14, the system with the BNN 
DMRAC successfully tracks the model reference system closely. To optimize the 
performance of the control system, many different learning parameters and training data 
were tried. However, once a trained BNN works, it tracks the reference model reasonably 


well for the inputs with similar characteristics. 


3. Experiment 3: System Model 3 
In this experiment, a nonlinear system described by Model 3 is chosen as 


yt+]) = a u(t? . (4-8) 


1+ y(t? 
The same procedures, used for the previous experiments were conducted. The training 
input u(t) and the resulting output y(t) of the neural network, and also the actual input 
v(t) and the desired response y,,(t) of the system are shown in Figure 15. The comparison 


between the actual system output and the reference model output are displayed in Figure 
16. 
4. Experiment 4: System Model 4 


In the final experiment, the nonlinear system is governed by: 


wit 1) = XO HE=D wlt- 2) ult 1) Uyft=2) + T+ alt) 


4-9 
1 y(t 1% y(t 2) eoet 


Other than the order of the system, there is not much change in the input vectors. Once 


again, the same procedures were applied to this nonlinear model. The training input u(t) and 
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Figure 14. The Output of the Controlled System Compared To That of the Reference 
Model for System Model 2 
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the resulting output y(¢) of the neural network, and also the actual input v(é) and the desired 
response y,,(#) are displayed in Figure 17. The output of the controlled system compared to 


that of the reference model is displayed in Figure 18. 
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Figure 15.The Training Input, the Resulting Output, the External Input and the 
Desired Response for the System Model 3 
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Figure 16 .The Output of the Controlled System Compared To That of the Reference 
Model for System Model 3 
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DESIRED OUTPUT {--} vs ACTUAL OUTPUT {..} 








Figure 18.The Output of the Controlled System Compared To That of the Reference 
Mode! for System Model 4 
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B. EXPERIMENTING WITH THE BNN DMRAC FOR UNKNOWN 

NONMINIMUM PHASE SYSTEMS 

In this section, the BNN DMRAC is extended to nonminimum phase systems. For 
experiment purposes, Model 2 is considered. 

The pole placement algorithm [Ref.7] which was used for LTI nonminimum phase 
systems was tested on nonlinear nonminimum phase systems. But the experiment showed 
that nonlinearities affect the results and cause the system to go unstable especially in the 
nonminimum phase case. 

So, it was decided not to consider the pole placement approach for this case and the 
BNN DMRAC is directly applied to a nonlinear nonminimum phase system. The simulations 
yield fairly reasonable results which are displayed in Figure 19 and in Figure 20. 

The unknown nonlinear nonminimum phase system used is chosen as 


yee) = MOKED + 25] Id) + 2.5] +u(t)+2u(t-1). (4-6) 


1+ tp + y(t- 1)? 
From the following plots we see that in some cases the output of the system is able to 
track the desired reference signal. However, in several runs, the nonminimum phase system 
does not perform as desired and it might become unstable. The extension of a neural network 


controller to this class of systems is still an open problem in the control system literature. 
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Figure 19. The Training Input, the Resulting Output, the External Input and the 
Desired Response for the Nonlinear Nonminimum Phase Unknown System 
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Figure 20. The Output of the Controlled System Compared To That of the Reference 
Model for the Nonlinear Nonminimum Phase System 
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C. IMPLEMENTATION OF THE BNN SOFTWARE SIMULATOR 

For this thesis research, a software BNN simulator was developed. Until neural 
network hardware systems or neurocomputers become commonly (and economically) 
available, most researchers will work with software simulators for neural networks. The 
software approach offers the full flexibity for development, allowing the user to exercise and 
experiment freely with the various features of the neural network [Ref.1]. The main 
drawback of this approach is the slow learning process of the simulator. 

By using the appropriate MATLAB’ codes, the following functions have been 
developed and used in simulations: 

* INITIAL 2 : This function initializes the BNN. It takes as input variables the 
parameters describing the number of inputs and number of outputs, number of neurons in 
the hidden layer(s). It yields as outputs the initial conditions of the weights and biases of the 
network. 

* NET 2: This function sets up the data structure for a 2 layer BNN. It takes as input 
variables a parameter describing the number of inputs and neurons in the hidden and output 
layer. It also takes as another input variable a parameter specifying the spreading range of 
the biased inputs. 

* BP 2: This function backpropagates the output errors to the input layer. It takes as 
input variables the parameter describing the weights and biases, and the random input, 
learning rate and the actual input. It yields updated weights and biases. 


* SIGMOID : This function introduces the nonlinearity to the neural network. 


* DSIG : This function takes the derivative of the sigmoid function. 


~MATLAB is a registered trademark of the Math Works, Inc. 
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These functions can be all implemented recursively, and imbedded in any iterative 
loop. The source codes of these functions used in this thesis research are provided in 


Appendix C. 
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V. OBSERVATIONS, DISCUSSIONS AND CONCLUSIONS 


A. SUMMARY 

A Direct Model Reference Adaptive Controller (DMRAC) for LTI systems was 
developed and applied to both minimum phase and nonminimum phase LTI systems for pole 
placement. The DMRAC for LTI systems was then extended to nonlinear systems by 
training a BNN to emulate a suitable nonlinear regression form that describes the system 
under consideration. 

Later, the control of four general classes of unknown nonlinear systems, modelled in 
discrete-time, using the BNN DMRAC was considered. Lastly, an experiment for unknown 


nonlinear nonminimum phase systems was conducted. 


B. OBSERVATIONS, DISCUSSIONS AND CONCLUSIONS 

A DMRAC designed with a least-squares estimator, assuming the system is LTT, failed 
to work for most nonlinear systems. Hence, the BNN DMRAC is an effective technique in 
controlling nonlinear systems where the conventional technique fails. 

Some experiments conducted showed that the BNN DMRAC performs its contro] 
function reasonably well for various types of inputs. However, for inputs with high 
frequency components (with respect to the sampling rate), the controlled system became 
quite oscillatory, so that it would not be able to track the reference model properly. In 


addition, the controller would sometimes saturate during training and therefore it failed to 
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control the system [Ref.1]. It is suggested that the solution to this problem is to increase the 
sampling rate with or without increasing the number of neurons and hidden layers used in 
the BNN. Off-line training with more appropriate data (i.e. training signals containing 
similar frequency characteristics as the actual signals experienced by the controlled system), 
and adjusting the learning parameters also helps to improve the tracking performance and 
avoid saturation [Ref.1]. Unfortunately, there is still no general rule to help select the most 
appropriate learning parameters. Hence, a great deal of experimentation is usually required. 

The BNN DMRAC designed for unknown nonminimum phase systems is not 
promising compared with their minimum phase counterpart. A nonminimum phase system 


is usually hard to stabilize adaptively by a direct approach. 


C. FURTHER RESEARCH AND DEVELOPMENT 

In this thesis research, the emphasis was to develop a structure for direct adaptive 
control of certain classes of unknown nonlinear systems using the BNN. The results of the 
experiments clearly showed that the BNN DMRAC is very effective in controlling unknown 
nonlinear systems. On the other hand, nonminimum phase systems at the moment do not 
yield satisfactory results and need more research efforts. 

Once more, in the design of the BNN, the selection of the number of layers, neurons, 
the type of nonlinear transformation, etc., is still at the discretion of the designer. This area 


definitely needs further research. 
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APPENDIX A 


DMRAC DESIGN FOR UNKNOWN MINIMUM PHASE LTI 


SYSTEMS 


In this Appendix we address the problem of designing an adaptive controller for a LT] 
system based on the algebraic approach. The unknown LTI system to be controlled is 


described by the following polynomial operators: 


Aq) q?- 0.2q+0.9, 


(A-]) 
B(q) = 3q. 
In order to track the reference model 
y(t) - O.8y (t-1) = wt-1), (A-2) 


the desired closed loop polynomial, P°(g) and the characteristic polynomial of the 
observer, a(g) are chosen arbitrarily and with roots in the stable region as 


P"(q) = J3qq - 0.8) = q(q - 0.8), 
. (A-3) 


a(qg) = q?-q+0.25. 
Using the algorithm in Chapter | Section B, the Diophantine equation can be formed 
as follows: 


(h,q + h,)(q? - 0.24 + 0.9) + (kg + k,)3q = (q? - q+ 0.25) [g - 0.24 + 0.9 - g(q- ne 
-4) 
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or 


(h,q + h,)(q? - 0.2q + 0.9) + (k,q + k,)3q = 0.6q° + 0.3q? - 0.75q+0.225.  (A-5) 


The MATLAB simulation for the above system yields the following steady-state gains: 


h, = 0.61234, 
h, = 0.24935, 
k, = 0.05678, 
k, = -0.42982. 


nN 


These results can be checked by using the Sylvester matrix formulations for this 


system which in this case is given by 


1 0 0 0 fh 0.6 
-0.2 1 3 : hl | 03 (A-6) 
0.9 -0.2 0 3] \k, -0.75 

0 09 0 O |x, 0.225 


By solving equation (A-6) we obtain the following values: 


h, = 0.6, 
h, = 0.25, 

k, = 0.0567, 
k, = -0.4133. 


Hence, the estimated gains needed to control the system are found to be very close to 


the steady-state gains obtained from the Sylvester matrix equations. 
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The simulation results of the experiment for the system above are displayed in Figures 
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Figure A-1. Reference Input and Estimated Control Parameters for LT] Minimum 
Phase Unknown System 
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' del *.met 

clear,clg,cle 


% Generating the reference input signal 
t=0: 100; 
v=square(t/(().5*pi)); 
kmax=length(v)+1; 
for i=I:kmax-1 

if v(i)<=0) 

v(i)=0; 

end 

end 


% Initialization of the parameters 
u=zeros(1,kmax); 
ubar=zeros(1,kmax); 
y=zeros(1,kmax); 
ybar=zeros(1,kmax); 
ym=zeros(1,kmax); 
th=zeros(5,kmax); 

th(5,4)=0.1; 

P=10000*eye(5); 


% Generating the reference model 


for j=2:kmax- | 
ym(j)=0.8*ym@-1)+vq-1); 
end 


% Designing the controller 
for k=4:kmax- | 
hl=th(1,k); 
h2=th(2,k); 
k1=th(3,k); 
k2=th(4,k); 
1b 1=th(5,k); 
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u(k)=(h 1+1) *u(k-1)+(h2-0.25)*u(k-2)+k 1 *y(k-1)+k2*y(k-2)+... 
(v(k)-v(k-1)+0.25*v(k-2))*ib 1; 

ubar(k)=u(k- 1 )-u(k-2)+0.25*u(k-3); 
y(k)=0.2*y(k- 1 )-0.9* y(k-2)+3*u(k- 1); 
ybar(k)=y(k)-1.8*y(k-1)+1.05*y(k-2)-0.2*y(k-3); 
phit(k,:)=[u(k-2) u(k-3) y(k-2) y(k-3) ybar(k)]; 
K(,k)=(P*phit(k,:)')/(1 +phit(k,:)*P*phit(k,:)'); 
P=P-(P*phit(k,:)'*phit(k,:)*P)/(1+phit(k,:)*P*phit(k,:)'); 
th(:,k+1)=th(:,k)+K(:,k)*(ubar(k)-phit(k,:)*th(:,k)); 

end 


% Plotting the results 
subplou(211) 

axis({Q 100 -0.1 1.2]); 
plot(v(1,1:100)); 
tile("REFERENCE INPUT v(t)’); 
xlabel('TIME (Sec.)'); 

ylabel( MAGNITUDE)); 
grid,pause 

axis 

clear t 


t=1:100; 
subplot(212) 


plot(t,th(1,1:100),t,th(2, 1:100),t,th(3, 1:100),t,th(4, 1:100),... 
t,th(5,1:100)); 

uitle(ESTIMATED CONTROL PARAMETERS’); 

xlabel('TIME (Sec.)'); 

ylabel( MAGNITUDE’); 

grid 

gtext(‘hl'); 

gtext(‘l/b1’); 

gtext(‘h2'); 

gtext('k1'); 

vtext('k2'); 

pause 

meta | 

cly 

subplou(211) 

plot(ym(1,1:100)); 

utle("THE SIGNAL TO BE FOLLOWED’); 

xlabel('TIME (Sec.)'); 

ylabel(’MAGNITUDE)); 


$1 





grid,pause 

subplot(212) 

plot(y(1,1:100)); 

tite(THE OUTPUT OF OUR PLANT); 
xlabel(‘TIME (Sec.)'); 

ylabel( MAGNITUDE); 

grid,pause 

meta 2 

clg 

axis({Q 100 -2 4]) 

plot(ym(1,1:100)); 

hold on 

plot(y(1, 1: 100),'*'); 

plot(y(1,1:100)); 

ttle(’ DESIRED OUTPUT ym(t) {--} vs ACTUAL OUTPUT y(t) {**}'); 
xlabel("TIME (Sec.)'); 

ylabel( MAGNITUDE); 

grid 

hold off 

meta 3 

axis(‘normal’); 

GeV Vo Vo Vo Vo We Ve Vo Ue Vo Yo Yo Vo Vo To Yo Yo Vo Go Yo Yo Yo Yo Vo Yo Yo Yo Yo Vo Ve Yo Yo Yo To Yo Ves Vo To Vo Yo Ve Ge 
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APPENDIX B 
DMRAC DESIGN FOR UNKNOWN NONMINIMUM PHASE 


LTI SYSTEMS 


In this Appendix we address the problem of designing an adaptive controller for a 
nonminimum phase LTI system based on the algebraic approach. The unknown LT] system 
to be controlled is described by the following polynomials: 


A(q) = 4? +q+1, 
(B-1) 


Bq) = q+2, 
where A(q) is stable and monic, and B(q) is monic and causes the system to be 
nonminimum phase. In order to track the reference model 
Y(t) - 0.8y,,(t-1) = Wt-)), (B-2) 
the desired closed loop polynomial, P’(q) and the characteristic polynomial of the 
observer, @(q) are chosen arbitrarily as 


P*(q) = (q- 0.8), = 


a(q) = q?-q+0.25. 


Using the algorithm in Chapter | Section C, the Diophantine equation can be formed 


as follows: (B-4) 
(hq +h,)(q?+q+1)+ (k,q + k,)(q + 2) (q? - q+ 0.25) [q? +.q+1-(q? - 1.6q + 0.64)], 
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or, 


(h,q +h,)(q? + q+ 1) + (k,q+k,(q +2) = 1.89? - 0.96q? - 0.39q + 0.21. (B-5) 


Also, P-*(q)a(q)is obtained as 


(q? - q + 0.25)(q? - 0.8q + 0.16) = q*- 1.8q? + 1.21q? - 0.364 + 0.04. (B-6) 


The MATLAB simulation for the above system yields the following gains, 


h, = 1.7927 
h, = -2.121 
k, = -0.6184 
k, = 1.1666 
b, = 0.0000 
b, = -0.6642 
c, = 0.6642 
c, = 0.3358. 


where b,, b,, c, and c, are the coefficients of the polynomials b(q) and c(q) respectively. 
These results can be checked with Sylvester matrix solution for this system which can 


be written as 


= . (B-7) 
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By solving equation (B-7) we obtain the following values: 


h,= 18, 
h, = -2.15, 
k, = -0.61, 
k= 118. 


Hence, the estimated gains needed to control the system are found to be very close 
to the gains obtained from Sylvester matrix equations. In fact, only h,, h,, k, and k, are 
needed in the control phase while b,, b,, ec; and c, are needed only in the estimation phase 
of the entire design process. 

The simulation results of the experiment for the system above are displayed in Figures 
(B-1) thru (B-3). 

The experiment then, is conducted for an unknown nonminimum phase and unstable 
LTI system. It is observed that the algorithm first stabilizes the system with the adaptive pole 
placement process and then provides the necessary input to track the desired reference 
signal. The results of this experiment for the nonminimum phase and unstable system are 


displayed in Figures (B-4) thru (B-6). For the experiment A(q) is chosen to be as follows: 


A(q) = q? +q+2. (B-8) 
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TeV Ve Ve Ve Ve To Vo Vo Vo Vo Vo Vo Te We Vo To Yo Yo Lo Vo Yo Yo Ve To Ve To Yo Yo Vo Le Vo To Vo Yo Yo Yo Yo To Vo Ve Ve Ve Ge 
% 


% ADMRAC DESIGN FOR UNKNOWN NONMINIMUM PHASE LTI SYSTEMS 


% Designing a controller using recursive least squares estimates 
% — of the compensator parameters. 


% Experimenting with a stable system 

% 

LAGU Ve Vo Ue Vo Vo Vo To Vo To Yo Vo Yo Vo To Yo Yo Yo Lo Yo Yo Yo Vo To Yo Yo To To To To Yo Go Yo Fo Lo ho Vo Vo Ue Ne 
vi 


' del *.met 
clear,clg,cle 


% Generating the reference input signal 


w= 10000; 
t=0:w; 
v=square(U(40* pi)); 


% Initialization of the parameters 


kmax=length(v)+1; 
u=zeros(1,kmax); 
ubar=zeros(1,kmax); 
uF=zeros(1,kmax); 
uFmin]=zeros(1,kmax); 
y=zeros(1,kmax); 
yF=zeros(1,kmax); 
yFmin1=zeros(1,kmax); 
ym=zeros(1,kmax); 
th=zeros(8,kmax); 
P=wteye(8); 


% Generating the reference model 


for j=2:kmax- 1] 
ym (j)=0.8* ym(j- L)+v(j-1)5 


end 
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% Designing the controller 


for k=6:kmax- | 
hl=th(1,k); 
h2=th(2,k); 
k1=th(3,k); 
k2=th(4,k); 
bl=th(5,k); 
b2=th(6,k); 
c1l=th(7,k); 
c2=th(8,k); 
u(k)=(h1+1)*u(k-1)+(h2-0.25) *u(k-2)+k 1 *y(k- 1)+k2*y(k-2)+v(k)-v(k- 1)+0.25*v(k-2); 
ubar(k)=u(k-2)-u(k-3)+0.25*u(k-4); 
uF(k)=u(k)-1.8*u(k-1)+1.21 *u(k-2)-0.36*u(k-3)+0.04*u(k-4); 
uFmin1(k)=u(k-1)-1.8*u(k-2)+1.2 1 *u(k-3)-0.36*u(k-4)+0.04*u(k-5); 
y(k)=-y(k-1)-y(k-2)+u(k-1)+2*u(k-2); 
yF(k)=y(k)-1.8*y(k-1)+1.21*y(k-2)-0.36*y(k-3)+0.04* y(k-4); 
yFmin 1 (k)=y(k-1)-1.8*y(k-2)+1.21*y(k-3)-0.36*y(k-4)+0.04*y(k-5); 
phit(k,:)=[u(k-3) u(k-4) y(k-3) y(k-4) uF(k) uFmin1(k) yF(k) yFmin1(k)]; 
K(:,k)=(P*phit(k,:)')/(1+phit(k,:)*P*phit(k,:)'); 
P=P-(P*phit(k,:)'*phit(k,:)*P)/(1+phit(k,:)*P*phit(k,:)'); 
th(:,k+1)=th(:,k)+K(:,k)*(ubar(k)-phit(k,:)*th(:,k)); 

end 


% Plotting the results 

subplot(211) 

axis([() w -1.5 1.5]); 

plot(v(1,1:w)); 

title((REFERENCE INPUT v(t)'); 

xlabel('TIME (Sec.)'); 

ylabel(MAGNITUDE’); 

grid,pause 

axis 

clear t 

subplot(212) 

t=l:w; 

plot(t,th(1,1:w),t,th(2, 1:w),th(3, 1:w),tth(4, }:w),... 
t,th(5,1:w),t,th(6, 1:w),t,th(7,1:w),t,th(8, l:w)); 

tide( ESTIMATED CONTROL PARAMETERS’); 

xlabel('TIME (Sec.)'); 

ylabel(MAGNITUDE)), 

grid; 

gtext(h!'); 

gtext(‘k2'); 
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gtext(cl'); 
glext(c2'); 
gtext('b1'); 
gtext(k1'); 
gtext('b2'); 
gtext(‘h2’); 
pause 
meta lb 


clg 

subplot(211) 

axis({0 w -10 L0J); 

plot(ym(1,7:w)); 

tite(THE SIGNAL TO BE FOLLOWED), 
xlabel(TIME (Sec.)'); 

ylabel (MAGNITUDE); 

grid, pause 

subplot(212) 

plot(y(l, l:w)); 

title(THE OUTPUT OF OUR PLANT’); 
xlabel(‘TIME (Sec.)'); 
ylabel(MAGNITUDE); 

grid,pause 

mela 2b 


clg 

axis([0 w -6 6]); 
plot(ym(1,l:w)); 

hold on 

plot(0.625*y(1, Lw),'+'); 
plot(0.625*y(1,1:w)); 
utle( DESIRED OUTPUT ym(t){--} vs ACTUAL OUTPUT y(t){++}'); 
xlabel( TIME (Sec.)'); 
ylabelC MAGNITUDE); 
grid 

hald off 

meta 3b 

axis 


Te Ge GoGo Go VeVi Ve Ve Ve Vo Vo Uo Vo Wo Fo Go Vo Ye Vo Ue We Fo Vo To To Ve Yo Vo Fo Vo He Ge Yo Fo Yo Yo Vo Ge Vo Ge Ki Ge 
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Yo Ge %: To To Yo Vo Vo Ve Yo Ye To Yo To To Yo To Vo Yo To To Yo To Vo To Yo Fo To To To Yo To Yo To To Vo Vo Vo Vo Fo Yo Go Ue G 
- 

Ye 

4% MA DMRAC DESIGN FOR UNKNOWN NONMINIMUM PHASE LTI SYSTEMS 

% 

% Designing a controller using recursive least Squares estimates 

% — of the compensator parameters. 

% 

‘% Experimenting with a unstable system 

% 

WGI GIGI DG Vo Vo Vo Go % Vo Vo Fo Yo To To Vo Fo Yo Vo To Vo Vo Go Yo Vo Vo Lo VO V0 Lo Vo Gi So Vi Ve oe 
% 

! del *.met 

clear,clg,cle 


% Generating the reference input signal 
w=10000; 

t=0:w; 

v=square(U/(40*pi)); 

% \nitialization of the parameters 
kmax=length(v)+1; 
u=zeros(1,kmax); 
ubar=zeros(1,kmax); 
uF=zeros(1,kmax); 
uFmin1=zeros(1,kmax); 
y=zeros(1,kmax); 
yF=zeros(1,kmax); 

yFmin | =zeros(1,kmax); 
ym=zeros(1,kmax); 
th=zeros(&,kmax); 


P=w*eye(8); 

% Generating the reference model 

for j=2:kmax- | 
ym(j)=0.8*ym(-1)+vQ-1); 

end 


% Designing the controller 

for k=6:kmax- | 
hl=th(1,k);h2=th(2,k);k 1=th(3,k); k2=th(4,k); bl=th(5,k);b2=th(6,k); 
cl=th(7,k);c2=th(8,k); 
u(k)=(h1+1)*u(k-1)+(h2-0.25)*u(k-2)+k | *y(k-1)+k2* y(k-2)+v(k)-v(k-1)+0.25 *v(k-2); 
ubar(k)=u(k-2)-u(k-3)+0.25 *u(k-4); 
uF(k)=u(k)-1.8*u(k-1)+1.21 *u(k-2)-0.36*u(k-3)+0.04*u(k-4); 
uFmin1(k)=u(k-1)-1.8*u(k-2)+1.21 *u(k-3)-0.36*u(k-4)+0.04*u(k-5): 
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y(k)=-y(k-1)-2*y(k-2)+u(k-1)+2*u(k-2); 

yF(k)=y(k)-1.8*y(k-1)+1.21 *y(k-2)-0.36*y(k-3)+0.04*y(k-4): 

yFmin I (k)=y(k-1)-1.8*y(k-2)+ 1.21 *y(k-3)-0.36*y(k-4)+0.04*y(k-5): 
phit(k,:)=[u(k-3) u(k-4) y(k-3) y(k-4) uF(k) uFmin1(k) yF(k) yEmin1(k)]; 
K(-,k)=(P* phit(k, :)’)/(1+phit(k, :)*P*phit(k,:)); 

P=P-(P* phit(k,.)*phit(k,:)*P)/(1+phit(k, :)*P*phit(k, .)'); 
th(:,k+1)=th(:,k)+K(:,k)*(ubar(k)-phit(k,:)*th(:,k)); 


end 


% Plotting the results 

subplot(21 1) 

axis([O w -1.5 1.5]); 

plot(v(1,1:w)); 

title"/REFERENCE INPUT v(t)'); 

xlabel('TIME (Sec.)'); 

ylabel((MAGNITUDE)), 

grid, pause 

axis 

clear t 

subplot(212) 

t=l:w; 

plot(t,th(1,]:w),t,th(2,1:w),t,th(3,1:w),t,th(4, 1:w),... 
t,th(S,1:w),t,th(6, 1:w),t,th(7, 1:w),t,th(8, 1:w)); 

title(ESTIMATED CONTROL PARAMETERS’), 

xlabel('TIME (Sec.)'); 

ylabel(MAGNITUDE'), 

grid; 

gtext(‘hl'):gtext(’h2'):gtext('cl');gtext('c2'); 

gtext('b1');gtext('b2'): gtext('k2'):gtext('k 1’); 

pause 

meta 1b 

clg 

subplot(21 1) 

axis([O w -10 10]); 

plot(ym(1,7:w)); 

title’THE SIGNAL TO BE FOLLOWED’), 

xlabel('TIME (Sec.)'); 

ylabel((MAGNITUDE)), 

grid pause 

subplot(212) 

plot(y(1,1:w)); 

title(THE OUTPUT OF OUR PLANT’); 
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xlabel('TIME (Sec.)'); 

ylabel(MAGNITUDE’), 

grid,pause 

meta 2b 

clg 

axis([O w -6 6]); 

plot(ym(1,1:w)); 

hold on 

plot(0.625*y(1,1:w),'t'), 

plot(0.625*y(1,1:w)). 

title((DESIRED OUTPUT ym(t){--} vs ACTUAL OUTPUT y(t)'{++}), 
xlabel('TIME (Sec.)'); 

ylabel( MAGNITUDE), 

grid 

hold off 

meta 3b 

axis 
YAYU*UwUVWUWVVAVMVW%%V%%%VMVW%V%V%0%%%%%%0%%%%%0%%A%%%%%%%0%% 
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APPENDIX C 


BNN SOFTWARE SIMULATOR 


ZeVe Ve Ve Po Fo To Ve ‘Te To To To To To To To To To Lo To ‘To ‘Te ‘To To To Vo To To To To To To To To To To To Te Fo To Fe Le Ne 
% 

J, SOFTWARE SIMULATOR FOR BNN 

% 

FTV Te To Ve To Fe Ye ‘To To To To To To Yo Yo To To To To To Fo To To To Yo Fo To To To To To To Yo Fo To To To Fo Po Yo Ve 
! del *.met 

clear,clg,cle 

rand(‘normal'); 


% Random input to train the system. 
num=15000; 

ul=rand(1,num); 

k=0:num; 


% Actual input to the system. 
v=0.04*sin(k/(9*pi))-0.03*cos(k/(S*pi)); 
kmax=length(v); 


% Calculating the reference output. 

ym=zeros(1,kmax-1); 

y=zeros(1,kmax-1); 

utm1=zeros(1,kmax-1); 

mu=0,04,; 

for i=2:kmax-] 
ym(i)=0.8*ym(i-1)+v(i-1); 

end 

for j=2:kmax- 1 
% modell 
yG+1)=0.5*yG)+0.4*y(j-1)+u1 G)43+0.3*ul (j)42-0. 1 *ul (j); 
%model2 y(j+1)=(y()*yG-1)*(yG)+2.5)/(1+y(G)42+y(Gj-1)%2))+ul (j); 
Jemodel3 y(j+])=yG)/(1+y(j)42)+ul (3; 
Femodel4 y(j+1)=(y(j)*yG-1)*yG-2)*ul G-1)*(yG-2)+1)+ul (j))/(1 +yG-1)42+yG-2)2): 

end 

% Initialization of the neural network. 
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[w3,w2,b2,w1,b1]=initial2(1,18,18,5); 


% Training the neural network. 
for t=4:kmax-1 
u=[ul (t-2) ul (t-3) y(t-2) y(t-3) (y(t)-0.8* y(t-1))]'; 
utm 1 (t-1)=net2(w3,w2,b2,w1,bl,u); 
[w3,w2,b2,w1,b]]=bp2(w3,w2,b2,w1,b1,u,ul(t-1),mu); 
end 


% Plots of the desired signals. 
subplot(221) 

axis([0 200 -4 4]); 

plot(ul (1:200)); 

title(; RANDOM INPUT’); 
grid 


subplot(222) 

axis({O 200 -4 4]); 

plot(utm1(1:200)); 

title OUTPUT OF THE NEURAL NETWORK’); 
grid 


subplot(223) 

axis({Q 1000 -0.1 0.1]); 
plot(v(1:1000)); 

title INPUT’); 

grid 


subplot(224) 

axis([O 1000 -0.4 0.4]); 
plot(ym(1:1000)); 

title(’ DESIRED RESPONSE’); 
grid,pause 

meta h 


% Controlling the plant with a neural identifier-controller. 
clg 

clear u; clear y;clear t; 

u=zeros(1,10000); 

y=zeros(1,10000); 

il=zeros(5, 10000); 

i2=zeros(5,10000); 
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for t=4:9999- 
i1¢,t)=[u(t-1) u(t-2) y(t-1) y(t-2) vio)’; 
u(t)=net2(w3,w2,b2,w1,b1,i1(:,t)); 
y(t)=(y(t- 1) *y(t-2)/(1 +y(t- 124+ (t-2)42))+u(t- 1); 
i2(:,t)=[u(t-2) u(t-3) y(t-2) y(t-3) (y(t)-0.8* y(t-1))]'5 
[w3,w2,b2,w1!,b1]=bp2(w3,w2,b2,w 1,b1,i2(:,t),u(t-1),mu); 
end 


% Ploting the results. 


subplot(111) 

n=1:999, 

axis([0 1000 -0.6 0.6]); 

plot(n,ym(1,1:999),n,y(1,1:999),".'); 

title DESIRED OUTPUT {--} vs ACTUAL OUTPUT {..}'); 
grid,pause 

meta hh 


FoVeDo Vo To Ve Fo Ke Le Fo To To Vo Vo Vo Ve Vo Yo Vo To Vo To To To To Vo Te Yo Yo To To To To To To Yo To To Yo Yo Fo Vo Vo Vo 


Te Fe Te Po To To Ve Yo To To To To To To Vo To Yo To To To Yo Yo Yo To To To Yo To To To Ve Yo Yo Yo Yo Fo Fe To Yo To Lo Vo 
aca 

% 

% BNN DMRAC WITH UNKNOWN NONLINEAR NONMINIMUM 

% PHASE SYSTEM 


FeV Te Te To Vo Vo Le Ke Vo ‘To To Yo Yo To To To Yo Yo To To Vo Yo Yo To To Yo Yo Yo Ve To Veo Yo Yo Yo Yo To Vo To To Yo To 
he 
! del *.met 


clear,clg,cle 

rand(‘normal’); 

% Random input to train the system. 
num=1000; 

ul=rand(1,500000); 

k=0:num; 

% Actual input to the system. 
v=0.03*square(k/(10*pi)); 
%v=0.04* sin(k/(9*pi))-0.03*cos(k/(5*pi)); 
kmax=length(v); 

% Calculating the reference output. 
ym=zeros(1,kmax-1); 


70 


y=zeros(1,kmax-1); 
utm1]=zeros(1,.kmax-1); 
mu=0.099; 
for i=2:kmax-1 
ym(i)=0.8*ym(i-1)+vG-1); 
end 
for j=2:kmax-1 
Femodel2 
y(jt D=(yG)*¥G-1)*(yG)+2.5)/(1+y(G)42+yG-1)42))+ul G)+2*ul Gj-1); 
%emodel3 yQ§+l=yG)/(1+yG)%2)+u1 G43; 
Ymodel4 y(j+1)=(yG)*yG- 1) *yG-2)*ul G-1)*(yG-2)+1)+u1 G))/C+yG-1)42+yG-2)42); 
J%emodell y(j+1)=0.5* yG)+0.4*yG-1)+ul G)43+0.3*ul G)42-0.1*ul G); 
end 


% Initialization of the neural network. 

[w3,w2,b2,w 1,b1]=initial2(1,20,20,5); 

% Training the neural network, 

for t=4:kmax-1 
u=[ul (t-2) ul (t-3) y(t-2) y(t-3) (y(t)-0.8* y(t-1))7's 
utm 1 (t-1)=net2(w3,w2,b2,w1,bl,u); 
[w3,w2,b2,w1,b1J=bp2(w3,w2,b2,w1,bl,u,ul(t-1),mu); 


end 


% Plots of the desired signals. 


subplot(221) 

axis([O 200 -4 4]); 
plot(ul(1:200)); 
titleRANDOM INPUT’); 
grid 


subplot(222) 

axis({0 200 -4 4]); 

plot(utm1(1:200)); 

utle(OUTPUT OF THE NEURAL NETWORK’); 
grid 


subplot(223) 
axis({0 1000 -0.1 0.1]); 
plot(v(1:1000)); 





title (INPUT); 
grid 


subplot(224) 

axis({O 1000 -0.4 0.4]); 
plot(ym(1:1000)); 
title(DESIRED RESPONSE’); 
grid,pause 

meta h 


% Controlling the plant with a neural identifier-controller. 


clg 

clear u 

clear y 

clear t 
u=zeros(1,num); 
y=zeros(1,num); 
11=zeros(5,num); 
i2=zeros(5,num); 


for t=4:999- 
11¢,t0=[u(t-1) u(t-2) y(t-1) y(t-2) v(d))'s 
u(t)=net2(w3,w2,b2,w1,bl,il(:,t); 
y(t)=(y(t- 1) *y(t-2)/(1+y(t-1)42+y (t-2)42))+u(t- 1); 
i2¢,t=[u(t-2) u(t-3) y(t-2) y(t-3) (y(t)-0.8* y(t-1))]s 
(w3,w2,b2,w1,b1]=bp2(w3,w2,b2,w1,b1,12(:,t),u(t-1),mu); 
end 


% Ploting the results. 


subplot(111) 

n=1:999; 

axis([O 1000 -0.6 0.6]); 

plot(n,ym(1,1:999),n,y(1,1:999),"."); 

title(DESIRED OUTPUT {--} vs ACTUAL OUTPUT {..}'); 
grid,pause 

meta hh 


TeV Yo Vo Po Te Te Ne Fe Te Te Ue Po To To To Te To ‘Te We Le ‘Fo Lo Fo To Po To To To To Po Te Wo Ve We Vo Ue Fe Fo To Ve Vo Le %e 
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Jo To To Ve Fe To Yo To Ne Vo Vo Ve Vo To Vo %e To Vo Fo Vo Lo Te Lo Lo Vo To Fo Wo Lo Fo Wo Yo To Vo Vo To Lo Yo To Ve Yo Wo Vo Ue 
% 
function [W3,W2,b2,W1,b1]=initial2(ny,M2,M1,nu) 


% 

K initializes the neural network 

% 

% ny=number of outputs, nu=number of inputs, 
% M2,M |=number of neurons in hidden layers 
% 


rand(‘normal') 
W3e=rand(ny,M2); 


W2=rand(M2,M1); b2=rand(M2,1); 
W |=ones(M 1,nu); bl=rand(M1,1); 
end 


Fe VoVo Te Ve To To Ue Ve To To To Vo Yo Lo To To To Yo Vo Yo Vo Yo Vo To Vo To Yo Vo Yo Vo Yo Vo Ue Yo Vo Te To Te Vo Te We Ve Po 


FeToSe Vo Vo Ue Le Le Te Yo Ve Vo To Yo Vo To Yo Vo Ve To Yo Fo To Vo Yo Vo To Vo Yo Yo Vo Vo Vo Ke Le Vo Vo Vo Ne ‘Te Ve Se Ge To 
% 
function y=net2(W3,W2,b2,W 1,bl,u) 

%X 

% sets up the data structure for 2 layer neural network 
%  u=input, y=output 

%  We=weights, b=biases 

% 

v=sigmoid(W 1*u+b1); 

z=sigmoid(W2*v+b2); 

y=W3*z; 

end 


Ge Dee Vo Ve Ve Fe Le Lo Te To To Vo Yo Yo To Yo Lo Yo Yo To Lo To Yo Vo Fo Lo Yo Lo To Yo To Ge To Vo Ve Fo To Le Ue Yo Le We 


oFe Te To Pe Te Fe Ke Ye Te Fe Yo To Vo Yo Ve To Vo To Yo To Fe To Yo To To To Vo Lo Ve Yo Ke ‘To Ho Te Ke Fo Le Fe Ue Ue Pe Ve Ge 
% 

function [W3,W2,b2,W 1,b1]=bp2(W3,W2,b2,W 1,b1,u,yd,mu) 

% 

%. updates the weights and the biases of a2 layer neural network with backpropagation 
% 

% wis the input to the network, 

% yd is the desired reference model to track 

% mu is the learning rate 


vb=W1*u+bl; v=sigmoid(vb); 
zb=W2*v+b2; z=sigmoid(zb); 
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y=W3*z; 

e=y-yd; 

d3=e; 
d2=diag(W3'*d3)*dsig(zb); 
dl=diag(W2'*d2)*dsig(vb); 


W3=W3-mu*d3*z'; 


W2=W2-mu*d2*v'; b2=b2-mu*d2; 
W1=W1-mu*d1*u’; bl=bl-mu*d1; 
end 


ToT Fe Te To Fe To To To To To To No Yo To To Yo Ve Yo To Vo Yo To To Yo Yo Yo To To Yo To To To To To To To To Ve Ve To Yo Vo Pe 


ToT Fo Ve Te To Te Ke To To Lo Yo To To Yo To To To To To To To To Yo Vo Yo To To Yo To To Vo Vo Yo To Vo Yo To Ve Fo To Yo To Fe 
% 

function y=sigmoid(x) 

%e 

% introduces the nonlinearity to neural network 

% 

x=min(x,100); x=max(x,-100); 

y= (1-exp(-x))./(1 +exp(-x)); 

end 


FV Te Ve Ve Ve Ve Fe To To To To Yo To To To Vo Yo Te To To To To To To Yo To Vo To Vo Yo Vo Yo Yo Vo Vo Vo To To Yo Vo To Fo Ve 


FeVDo Te Te Ke Ve To Lo To To Vo Vo Yo To To To Yo To To Vo Lo To To To To To Yo To To Lo To Yo Vo To Yo To Te Fo To To To Lo To 
Jo 


function d=dsig(x) 


% 

% takes the derivative of sigmoid 
% 

% x isa vector 

% 


x=min(x,100); x=max(x,-100); 
temp=exp(-x); 

d=temp ./ (1+2*temp + temp .* temp); 
end 


Fe Ve Ve Ve Ve Pe Lo Ue Le Te Fo Te Vo Vo To To To To Yo Uo To To We Yo Fo To To To Yo Yo To To Yo Uo To To To Yo To Vo To Ge Yo Ge 
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